Multiagent Credit Assignment in a Team of Cooperative Q-Learning Agents with a Parallel Task

نویسندگان

  • Ahad Harati
  • Majid Nili Ahmadabadi
چکیده

Traditionally in many multiagent reinforcement learning researches, qualifying each individual agent’s behavior is responsibility of environment’s critic. However, in most practical cases, critic is not completely aware of effects of all agents’ actions on the team performance. Using agents’ learning history, it is possible to judge the correctness of their actions. To do so, we use team common credit besides some suitable criteria. This way an internal critic distributes the environment’s reinforcement among the agents. Continuing our previous research [1], in this paper three such criteria, named Certainty, Normal Expertness and Relative Normal Expertness, for a team of agents with a parallel task are introduced and compared. It is experimentally shown that these criteria can be used to learn from a common team credit in reasonable time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

D++: Structural credit assignment in tightly coupled multiagent domains

Autonomous multiagent teams can be used in complex exploration tasks to both expedite the exploration and improve the efficiency. However, use of multiagent systems presents additional challenges. Specifically, in domains where the agents' actions are tightly coupled, coordinating multiple agents to achieve cooperative behavior at the group level is difficult. In this work, we demonstrate that ...

متن کامل

Multi-objective multiagent credit assignment in reinforcement learning and NSGA-II

Multiagent systems have had a powerful impact on the real world. Many of the systems it studies (air traffic, satellite coordination, rover exploration) are inherently multi-objective, but they are often treated as single-objective problems within the research. A key concept within multiagent systems is that of credit assignment: quantifying an individual agent’s impact on the overall system pe...

متن کامل

Using communication to reduce locality in distributed multiagent learning

This paper attempts to bridge the elds of machine learning, robotics, and distributed AI. It discusses the use of communication in reducing the undesirable eeects of locality in fully distributed multi-agent systems with multiple agents/robots learning in parallel while interacting with each other. Two key problems, hidden state and credit assignment, are addressed by applying local undirected ...

متن کامل

A Multiagent Reinforcement Learning algorithm to solve the Community Detection Problem

Community detection is a challenging optimization problem that consists of searching for communities that belong to a network under the assumption that the nodes of the same community share properties that enable the detection of new characteristics or functional relationships in the network. Although there are many algorithms developed for community detection, most of them are unsuitable when ...

متن کامل

Quicker Q-Learning in Multi-Agent Systems

Multi-agent learning in Markov Decisions ProbK i s chanenging because of the presence ot two credit assignment problems: 1) How to credit an action taken at time step t for rewards received at t’ > t ; and 2 ) How to credit an action taken by agent z considering the system reward is a function of the actions of all the agents. The first credit assignment problem is typically addressed with temp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002